Runtime characterisation of irregular accesses applied to parallelisation of irregular reductions

نویسندگان

  • David E. Singh
  • María J. Martín
  • Francisco F. Rivera
چکیده

Irregular reduction operations are the core of many large scientific and engineering applications. There are, in the literature, different methods to solve these operations in parallel. In this paper we discuss a new technique which improves performance significantly, both in terms of execution time and memory overhead. These improvements are achieved in the preprocessing as well as in the resulting parallel code. Our proposal is based on the use of the Irregular Access Region Descriptor (IARD). This data structure is a compact characterisation of indirectly accessed arrays that can be used for the efficient parallelisation of a wide spectrum of irregular codes. In this paper we present its application to parallelise irregular reduction operations on a shared memory machine. We compare our proposal with two other competitive solutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Memory System Support for Irregular Applications

Because irregular applications have unpredictable memory access patterns, their performance is dominated by memory behavior. The Impulse con gurable memory controller will enable signi cant performance improvements for irregular applications, because it can be congured to optimize memory accesses on an application-by-application basis. In this paper we describe the optimizations that the Impuls...

متن کامل

Processing irregular codes containing arrays with multi-dimensional distributions by the PREPARE HPF compiler

The automatic parallelization of irregular codes for massively parallel systems is a challenging problem of growing importance. This paper describes techniques for processing irregular codes that are being implemented in the Prepare HPF compiler. Our work makes two contributions: (1) it constructs a library called Parti+ that is used for the runtime analysis of codes with runtime data dependent...

متن کامل

Evaluation of Compiler and Runtime Library Approaches for Supporting Parallel Regular Applications

Important applications including those in computational chemistry, computational fluid dynamics, structural analysis and sparse matrix applications usually consist of a mixture of regular and irregular accesses. While current state-of-the-art run-time library support for such applications handles the irregular accesses reasonably well, the efficacy of the optimizations at run-time for the regul...

متن کامل

Improving Compiler and Run-Time Support for Irregular Reductions Using Local Writes

Current compilers for distributed-memory multiprocessors parallelize irregular reductions either by generating calls to sophisticated run-time systems (CHAOS) or by relying on replicated buuers and the shared-memory interface supported by software DSMs (TreadMarks). We introduce LocalWrite, a new technique for parallelizing irregular reductions based on the owner-computes rule. It eliminates th...

متن کامل

Optimizing Irregular HPF Applications using Halos

This paper presents language features for High Performance Fortran (HPF) to specify non-local access patterns of distributed arrays, called halos, and to control the communication associated with these non-local accesses. Using these features crucial optimization techniques required for an e cient parallelization of irregular applications may be applied. The information provided by halos is uti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJCSE

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2005